arXiv:cs/0503006vl [cs.IT] 2 Mar 2005 


SUBMITTED TO ISIT 2005 ON 31 JANUARY 2005 


1 


A New Non-Iterative Decoding Algorithm for the 
Erasure Channel : Comparisons with Enhanced 

Iterative Methods 

J. Cai, C. Tjhai, M. Tomlinson, M. Ambroze and M. Ahmed 
Fixed and Mobile Communications Research 
University of Plymouth 
PL4 8AA, United Kingdom 

email: {jcai,ctjhai,mtomlinson,mambroze,mahmed}@plymouth.ac.uk 


Abstract —This paper investigates decoding of binary linear 
block codes over the binary erasure channel (BEC). Of the 
current iterative decoding algorithms on this channel, we review 
the Recovery Algorithm and the Guess Algorithm. We then 
present a Multi-Guess Algorithm extended from the Guess 
Algorithm and a new algorithm - the In-place Algorithm. The 
Multi-Guess Algorithm can push the limit to break the stopping 
sets. However, the performance of the Guess and the Multi- 
Guess Algorithm depend on the parity-check matrix of the code. 
Simulations show that we can decrease the frame error rate by 
several orders of magnitude using the Guess and the Multi-Guess 
Algorithms when the parity-check matrix of the code is sparse. 
The In-place Algorithm can obtain better performance even if the 
parity check matrix is dense. We consider the application of these 
algorithms in the implementation of multicast and broadcast 
techniques on the Internet. Using these algorithms, a user does 
not have to wait until the entire transmission has been received. 


I. Introduction 

The Binary Erasure Channel (BEC) was introduced by Elias 
[1] in 1955. It counts lost information bits as being “erased” 
with probabilities equal to 0.5. Currently, the BEC is widely 
used to model the Internet transmission systems, in particular 
multicasting and broadcasting. 

As a milestone, Luby et. al. [2] proposed the first realization 
of a class of erasure codes - LT codes, which are rateless and 
are generated on the fly as needed. However, LT-codes cannot 
be encoded with constant cost if the number of collected 
output symbols is close to the number of input symbols. In [3], 
Shokrollahi introduced the idea of Raptor codes which adds 
an outer code to LT codes. Raptor codes have been established 
in order to solve the error floors exhibited by the LT codes. 

On the other hand, low-density parity-check (LDPC) codes 
have been studied [5] to [8] for application to the BEC. The 
iterative decoding algorithm, which is the same as Gallager’s 
soft-decoding algorithm [9], was implemented [5]. Capacity- 
achieving degree distributions for the binary erasure channel 
have been introduced in [5], [6] and [7]. Einite-length analysis 
of LDPC codes over the BEC was accomplished in [8]. In that 
paper, the authors have proposed to use finite-length analysis 
to find good finite-length codes for the BEC. 

In this paper, we show the derivation of a new decoding 
algorithm to improve the performance of binary linear block 


codes on the BEC. The algorithm can be applied to any linear 
block code and is not limited to LDPC codes. Starting with 
superposition of the erased bits on the parity-check matrix, we 
review the performance of the iterative decoding algorithms, 
described in the literature, for the BEC, principally the Re¬ 
covery Algorithm and the Guess Algorithm [10]. In Section 
irm we propose an improvement to the Guess Algorithm 
based on multiple guesses: the Multi-Guess Algorithm and 
give a method to calculate the minimum number of guesses 
required in the decoding procedure. In this section, we also 
describe a new, non iterative decoding algorithm based on 
a Gaussian-Reduction method [11] by processing the parity- 
check matrix. In Section EYl we compare the performance of 
these algorithms for different codes using computer simula¬ 
tion. In Section|V] we discuss the application of these decoding 
algorithms for the Internet. Section iyh concludes the paper. 

11. Preliminaries 

A. Matrix Representations of the Erased Bits 

Let H denote the parity-check matrix. Considering an Lx N 
binary linear block code, we assume that the encoded sequence 
is X = {xi,X 2 , ■ ■ ■ ,xn}- After being transmitted over the 
erasure channel with erasure probability e, the encoded se¬ 
quence can be divided into the transmitted sub-sequence and 
the erased sub-sequence, denoted as y = {j/i, j/ 2 , ■ ■ ■, yzi} and 
ye = {yei, ye 2 , ■ ■ ■, Veh} respectively, where li + I 2 = N. 

Corresponding to the parity check matrix of the code, we 
can generate an erasure matrix {L^ x N) which contains 
the positions of the erased bits in H. Then we denote the set 
of erased bits i that participate in each parity check row by 
Ei = {j : = 1} with h standing for “horizontal” and 

the number of erased bits in is denoted by \E^\. Similarly 
we define the set of checks in which bit j participates, 
Ej={i: = 1} with v standing for “vertical”, and 

the number of erased bits in UJ is denoted by \Ej\. Let 
Eh = {E^ I i e {1,2,...,LJ} and E" = {E- \ j G 
{1,2,...,A}}. The matrix representation is shown in Pig. 
[2 where an “x” represents an erasure. 

B. Current Iterative Decoding Algorithms for the BEC 

In [5], the message-passing algorithm was used for reliable 
communication over the BEC at transmission rates arbitrarily 
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Fig. 1. A matrix representation of the erased bits 

close to channel capacity. The decoding algorithm succeeds if 
and only if the set of erasures do not cause stopping sets [8]. 
For completeness, this algorithm is briefly outlined below: 
Recovery Algorithm 

• step 1 Generate the and obtain the 

• step 2 For i G {1,2,..., if \E^ \ — 1, we replace the 
value in the bit position i with the XOR of the unerased 
bits in that check equation. Then we remove the erasure 
from the erasure matrix. 

• step 3 Continue from step 2 until all the erased bits are 
solved or the decoding cannot continue further. 

The decoder will fail if stopping sets exist. 

We can break the stopping sets by performing several 
“guesses” of the unsolved erased bits. This algorithm is called 
the Guess Algorithm [10]. 

Guess Algorithm 

• Step 1 Run the decoder with Recovery Algorithm until it 
fails due to stopping set(s). 

• Step 2 In order to break the stopping set, when \E!^\ = 
2, we guess one of the erased symbols and update the 
erasure matrix and E*^. 

• Step 3 Continue from step 1 until all the erased symbols 
are solved or the decoding cannot continue further. If the 
decoder cannot continue, declare a decoder failure and 
exit. 

• Step 4 Creat a list of 2® solutions, where g is the number 

of guesses made. From the list Coutt, , k € {1, 2,..., 2®}, 
pick the one that satisfies = 0. 

Obviously, compared to the Recovery Algorithm, the com¬ 
plexity of this algorithm increases with g. Usually, we limit the 
number of guesses to a small number g^. If after g^ guesses, 
the decoding still cannot be finished, a decoding failure is 
declared. For sparse codes with low-density El, e. g. LDPC 
codes, the Guess Algorithm can improve the performance with 
g < 3 guesses as shown in Section 113 

The decoding algorithm is more efficient when the bits to be 
guessed are carefully chosen. These are termed “crucial”bits. 
The crucial bits are chosen on the basis of the highest value 
of |EJ| with the value of \Ej\ = 2. 

III. Improved Decoding Algorithms eor 
Non-sparse Linear Block Codes for the BEC 

For non-sparse linear codes, it is common to encounter more 
than 2 unsolved symbols in each row of after running the 


Guess Algorithm, due to the high-density of their parity check 
matrix. In these cases, we cannot break the stopping set by 
guessing one erased symbol in a row only. More than 1 erased 
symbols at one time need to be guessed. We can calculate the 
minimum number of guesses before the decoding. 

Lemma 3.1: Consider the chosen erased symbols in each 
row as an erased group. Let ujs denote the set of rows with 5 
erasures, that is, ojs = {i \ \E^\ = i5}. And xs is the set of 
rows which satisfies: 

xs = {i & UJS \ 3fc,p G Ef,such as k p, |Efc| = \Ep \ = 1}. 

(1) 

Then 

minp = |a; 5 | -I- 1 (2) 

where 1 accounts for the need for at least one “crucial” row. 

Proof: When the guessing process stops, there are more 
than 2 erased symbols in each erased row. The rows that have 
more than two bits {k,p) which do not participate in any other 
row (i. e. \E'^\ = \E'!^\ = 1) cannot be solved by other rows, 
and so at least one of these bits has to be guessed. So the 
minimum number of guesses equals to the number of all the 
independent guesses plus one more “crucial” guess to solve 
the other rows. ■ 

For the Multi-Guess Algorithm, a whole row is guessed. A 
crucial row c is defined as follows: 

1 ) C&US 

2) J2jeEh \^j\ maximized over c in uis 
The Multi-Guess Algorithm is given below: 

Multi-Guess Algorithm 

> step 1 Run the decoder with Guess Algorithm until 

\E^\ > 2 for i 

• step 2 Evaluate the value of min(( 7 ). If min(( 7 ) > the 
decoding declares a failure and exits. 

• step 3 Group the rows with \E^\ = 5 as ujs, where i G 

{1, 2,..., Le}. 

• step 4 Eind the “crucial” row and guess all erased bits in 
that row. (There will be at most guesses.) 

• step 5 Guess one bit p with \Ep\ = 1 in each of the 
independent rows, i.e. the rows in Xg. 

• step 6 Update M^, E*^ and E''. Continue the decoding 
from step 3 to step 5 until all the erased bits are solved 
or the decoding cannot continue further. 

The disadvantages of Guess and Multi-Guess Algorithms 
include the decoding complexity and the correctness of the 
results. The decoding complexity grows exponentially with the 
number of guesses. It is possible that the group guess declares 
a wrong value as the result of the decoder. Although this kind 
of situation happens only when the value of e is very small, 
it is still undesirable. 

Let x' denote the received vector, where x' = yljyc. 
We now devise a reduced complexity algorithm to decode 
the erased bits by solving the equation 0 using the Gaussian 
Reduction method [11]. 

iFx''^ = 0. (3) 

According to [8], the optimal decoding is equivalent to solving 
the linear system, shown in the equation |3 If the equation 
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13 has a unique solution, the optimal algorithm is possible. 
Guassian Reduction algorithm is considered as the optimal 
algorithm over the EEC. We propose a reduced complexity 
Guassian Reduction algorithm - In-place Algorithm [4] by 
elimilating the column-permutations required. This algorithm 
is stated as follows; 

In-place Algorithm 

• step 1 The codeword is received and are substituted 
in positions of erased bits in H. Starting with one of 
the erased symbols, the first equation containing this 
symbol is flagged that it will be used for the solution of 

. This equation is subtracted from all other equations 
containing and not yet flagged to produce a new set of 
equations. The procedure repeats until either non flagged 
equations remain containing y^^ (in which case a decoder 
failure is declared) or no erased symbols remain that are 
not in flagged equations. 

• step 2 Let be the erased symbols at the last flagged 
equations. In the latter case, starting with this equa¬ 
tion is solved to And j/iast and this equation is unflagged. 
This coefficient is substituted back into the remaining 
flagged equations containing yiast. The procedure now 
repeats with the second from last flagged eqaution now 
being solved for t/g,^„_i. This equation is unflagged and 
followed by back substitution of t/iast-i for t/g,,„,_i in the 
remaining flagged equations. 



Fig. 2. Erasure Correction Using In-place Algorithm 

A block schematic of the decoder is shown in Fig|2l The 
received bits are stored in the shift register with the erased 
bits being replaced by the unknown j/g. The Gaussian reduced 
equations are computed and used to define the connection of 
bit adders from the respective shift register stage to compute 
the outputs di to dn- The non erased symbols contained in the 
shift register are switched directly through to the respective 
output so that the decoded codeword with no erased bits is 
present at the outputs di through to dn- 


IV. Results 

We evaluated the performance of the Recovery Algorithm 
with the LT codes with Soliton distribution as described in 
[2] and irregular LDPC codes. As shown in Fig. |3 the 
performance of irregular LDPC codes is significantly better 
than that of the LT codes for the same block length. As a 
consequance, we use LDPC codes to benchmark the remaining 
algorithms. 


0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 

Coderale(R) 

Fig. 3. Performance of the LT codes and irregular LDPC codes with erasure 
probability = 0.2 

A particularly strong binary code and which has a sparse H 
is the cyclic LDPC code (255,175), which has a length of 255 
bits after encoding of 175 information bits. Since the parity- 
check polynomial of the (255,175) ' code is orthogonal on 
every bit position, the minimum Hamming distance is 1 -f w, 
where w denotes the number of ones per row in H [12]. 

The applicability of the decoding methods above depends 
on the error correcting code being used and specifically on 
the parity check matrix being used. The performance of this 
code for the Recovery, the Guess and the In-place Algorithms 
is shown in Fig. 0] in terms of the probability of decoder 
error (FER) as a function of the erasure probability for every 
transmitted bit. 



Fig. 4. Performance of the Cyclic LDPC (255,175) with the Guess , the 
Multi-Guess and the In-place Algorithms 

Due to its sparse parity check matrix. Guess algorithm with 
less than 3 guesses can achieve more than 1 order of magnitude 
improvement compared to that of Recovery Algorithm. In 
addition, from Fig. |4] we also can see that the curve of Guess 
Algorithm is very close to the curve of In-place Algorithm, 
which means Guess Algorithm is a “near optimal decoding” 
algorithm when it has a sparse parity check matrix. 

Fig.ia shows the performance of the (341,205) LDPC code 
^ with the Recovery, the Guess, the Multi-Guess and the In- 
place Algorithms. Comparing these results of the Recovery 
and Guess Algorithms, the Multi-Guess Algorithm can obtain 
the results by several orders of magnitude better. For example, 

*The (255,175) Cyclic LDPC code has a minimum Hamming distance of 
17. 

^The (341,105) LDPC code has a minimum Hamming distance of 16. 
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Fig. 5. Performance of the Cyclic LDPC (341,205) with the Recovery, the 
Guess, the Multi-Guess and the In-place Algorithms 


when the erasure probability equals to 0.3, the Multi-Guess 
Algorithm with pmax = 3 is one order of magnitude better 
than the Recovery and Guess Algorithms, when gmax = 5, 
the Multi-Guess Algorithm is 2 order2 of magnitude better 
than the Recovery and the Guess Algorithms. As an optimal 
decoding algorithm, the In-place Algorithm can achieve 4 
orders of magnitude better than the Recovery and the Guess 
Algorithm. 

The ultimate performance of the In-place Algorithm as a 
function of error correcting code is shown in Fig. for the 
example (255,175) code which can correct a maximum of 80 
erased bits. Fig.l^shows the probability density function of the 
number of erased bits short of the maximum correctable which 
is N — L. The results were obtained by computer simulations. 
The probability of being able to correct only 68 bits, a shortfall 
of 12 bits, is 1.1 X 10“^. Simulations indicate that on average 
77.6 erased bits may be corrected for this code. In comparison 
the BCH (255,178) code having similar rate is also shown in 
Fig. El The BCH code has a similar rate but a higher minimum 
Hamming distance of 22 (compared to 17). It can be seen 
that it has better performance than the (255,175) code but it 
has a less sparse parity check matrix and consequently it is 
less suitable for Recovery Algorithm and Guess Algorithm. 
Moreover the average shortfall in erasures not corrected is 
virtually identical for the two codes. 



Fig. 6. Comparison of Probability Distribution of Number of Erased Bits 
not Connected from Maximum Correctible (N-L) for (255,175) code and BCH 
(255,178) code 

The simulation results of using In-place Algorithm for the 
(103,52) quadratic residue binary code [13] are shown in Fig. 
0 The minimum Hamming distance for this code is 19 and the 


results are similar to that of the (255,178) BCH code above. 
It is found from the simulations that on average 49.1 erasure 
bits are corrected (out of a maximum of 51) and the average 
shortfall from the maximum is 1.59 bits. 



Number ol«rsswl bits stuft'Of n-k 


Fig. 7. Probability Distribution of Number of Erased Bits not Corrected 
from Maximum Correctible (N-L) for (103,52) code quadratic redisue code 


Similarly the results for the extended BCH (128,64) code is 
shown in Fig.|3 This code has a minimum Hamming distance 
of 22 and has a similar probability density function to the other 
BCH codes above. On average 62.39 erasure bits are corrected 
(out of a maximum of 64) and the average shortfall is 1.61 
bits from the maximum. 



Fig. 8. Probability Distribution of Number of Erased Bits not Corrected 
from Maximum Correctible (N-L) for (128,64) extended BCH code 


V. Application 

In multicast and broadcast information is transmitted in data 
packets with typical lengths from 30 bits to 1000 bits. These 
packets could define a symbol from a Galois field [12], viz 
GF{2^) but with m equal to 30 or more up to and beyond 
1000 bits this is impracticable and it is more convenient to use 
a matrix approach with the packets forming the rows of the 
matrix and columns of bits encoded using an error correcting 
code. Usually, but not essentially the same code would be used 
to encode each column of symbols. The matrix of symbols 
may be defined as: 

There are a total of (s -f 1) • fc information symbols which 
encoded using the parity check equations of a selected code 
into a total number of transmitted symbols equal to (s -f 1) • 
n. The symbols are transmitted in a series of packets with 
each packet corresponding to a row as indicated above. For 
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booboibo2bo3 ■ ■ - bos = packet 1 

6lof'll^>l2f'l3 ■ • ■ bis = packet 2 

b2ob2lb22b23 ■ ■ ■b2s = packet 3 


bn —lobn—llbn—12bn —13 • • • bn —Is — packet 11 


example the row: & 20 & 21 & 22&23 .^ 2 s is transmitted as a 

single packet. 

Self contained codewords are encoded from each column of 
symbols. For example 5oofcio^20.&fc-io form the infor¬ 

mation symbols of one codeword and the remaining symbols, 

bk+obk+wbk +20 .fen -10 are the parity symbols of that 

codeword. As a result of network congestion, drop outs, loss 
of radio links or other multifarious reasons not all of the 
transmitted packets are received. The effect is that some rows 
above are erased. The decoding procedure is that codewords 
are assemble from the received packets with missing symbols 
corresponding to missing packets marked as Zij. For example, 
if the second packet only is missing above: 

• The first received codeword corresponds to the first 

column above and is feoo-Ziofe20 . fen-10 

• The second codeword corresponding to the first column 

above and is feoi2iife2i.fen-11 and so on. 

All the algorithms stated in Section 2 may be used to solve 
for the erased symbols zio in the first received codeword, and 
for the erased symbol 211 in the second received codeword and 
so on up to the s'th codeword (column) solving for zis-i- 

As an example, the binary, extended (128, 64) BCH code 
could be used to encode the information data. The packet 
length is chosen to be 100 bits, and the total transmission 
could consist of 128 transmitted packets (12,800 bits total) 
containing 6,400 bits of information. On average as soon 
as any 66 packets from the original 128 packets have been 
received, the remaining 62 packets are treated as if they 
are erased. 100 codewords are assembled, decoded with the 
erasures solved and the 6,400 bits of information retrieved. 
One advantage is that a user does not have to wait until the 
entire transmission has been received. 

VI. Conclusions 

In this paper, we presented different decoding algorithms 
of LDPC codes over the BEC: Recovery, Guess, Multi-Guess 


and In-place Algorithms. The Multi-Guess Algorithm is an 
extension to Guess Algorithm, which can push the limit to 
break the stopping sets. We show that Guess and Multi- 
Guess Algorithms are parity-check matrix dependent. For the 
codes with sparse parity-check matrix. Guess and Multi-Guess 
Algorithms can be considered as “Near-optimal Decoding 
Methods”. On the other hand. In-place Algorithm is not. It’s 
an optimal method for the BEC and able to correct N — L — p 
erasures, where p is a small positive integer. 

We also considered these algorithms in the implementation 
of multicast and broadcast. Using these algorithms, a user does 
not have to wait until the entire transmission has been received. 
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